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ABSTRACT 


Predicting the arrival times of coronal mass ejections (CMEs) and their related waves at Earth is an important 
aspect of space weather forecasting. The Shock Propagation Model (SPM) and its updated version (SPM2), which 
use the initial parameters of solar flare-Type II burst events as input, have been developed to predict the shock 
arrival time. This paper continues to investigate the influence of solar disturbances and their associated CMEs on 
the corresponding interplanetary (IP) shock’s arrival at Earth. It has been found that IP shocks associated with 
wider CMEs have a greater probability of reaching the Earth, and the CME speed obtained from 
coronagraph observations can be supplementary to the initial shock speed computed from Type II radio bursts 
when predicting the shock’s arrival time. Therefore, the third version of the model, i.e., SPM3, has been developed 
based on these findings. The new version combines the characteristics of solar flare-Type II events with the initial 
parameters of the accompanying CMEs to provide the prediction of the associated IP shock’s arrival at Earth. The 
prediction test for 498 events of Solar Cycle 23 reveals that the prediction success rate of SPM3 is 70%-71%, 
which is apparently higher than that of the previous SPM2 model (61%-63%). The transit time prediction error of 
SPM3 for the Earth-encountered shocks is within 9 hr (mean-absolute). Comparisons between SPM3 and other 
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similar models also demonstrate that SPM3 has the highest success rate and best prediction performance. 
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1. INTRODUCTION 


As one of the most violent forms of solar activity, coronal 
mass ejections (CMEs) can lead to non-recurrent geomagnetic 
storms when they collide with Earth’s magnetosphere if they 
contain a long and strong southward magnetic field component 
(Gosling et al. 1991; Dryer 1994; Green & Baker 2015). They 
are thought to be major sources of extreme space weather 
events. Fast CMEs usually drive an interplanetary (IP) shock 
ahead of them. IP shocks can accelerate particles in their 
propagation medium and produce large solar energetic particle 
events (Gopalswamy et al. 2003; Cliver & Ling 2009), 
compress the magnetosphere, and produce corresponding 
effects (Green & Baker 2015) when they reach the Earth’s 
orbit. Type II bursts are the evidence of these shock waves as 
they propagate in the solar corona and IP space. A review of 
the dynamic processes of CMEs in IP space is provided by 
Wang et al. (2013). Due to the important space weather effects 
of CMEs and their associated shocks, predicting their arrival 
times at Earth’s orbit with sufficient lead time has been a 
significant aspect of space weather prediction. Scientists have 
developed various kinds of models for the CME/shock arrival 
time prediction (e.g., Feng et al. 2009b, 2009a). Details of the 
principles of these models, as well as their current prediction 
status, can be found in the recent review paper of Zhao & 
Dryer (2014). 

We have developed Shock Propagation Models, i.e., SPM 
(Feng & Zhao 2006) and SPM2 (Zhao & Feng 2014), to predict 
the shock arrival time at Earth. These models are based on an 
analytical solution to the propagation of blast waves in a 
moving medium. The inputs include the solar source location 
of the disturbance, the proxy piston driving time duration, the 
initial shock speed, and the background solar wind speed. The 
outputs determine whether or not the shock will hit Earth 
(SPM2) and the corresponding arrival time if it does (SPM, 


SPM2). Prediction results for 551 events of Solar Cycle 23 
reveal that the prediction accuracies of SPM2 are similar to the 
Fearless Forecast models including STOA (Dryer & 
Smart 1984; Smart et al. 1984; Smart & Shea 1985), ISPM 
(Smith & Dryer 1990, 1995), and HAFv.2 (Dryer 
et al. 2001, 2004; Fry et al. 2001, 2003, 2007; McKenna- 
Lawlor et al. 2002, 2006, 2012; Sun et al. 2002a, 2002b, 2003; 
Smith et al. 2005, 2009). The prediction success rate of SPM2 
is ~ 60% for both the Earth-encountered and Earth-missed 
shocks. The prediction errors of the arrival time for Earth- 
encountered shocks are about 12hr (rms error) and 10hr 
(mean-absolute; MA error). SPM2 can now be freely accessed 
and used online (http: //www.spaceweather.ac.cn /groupmodel. 
php?group=sigma). 

However, it is difficult to further improve the prediction 
accuracy of SPM2. One of the most important reasons for this 
difficulty is that the model neglects characteristics of the 
associated CMEs in its inputs. Like the Fearless Forecast 
models, the shock’s initial speed adopted in SPM2 is computed 
from the frequency drift rate of metric Type II radio bursts. 
This speed is often different from the CME speed remote- 
sensing observed by coronagraphs on board spacecraft (e.g., 
SOHO/LASCO and STEREO/SECCHI). The other kinematic 
parameters of CMEs, such as angular width, are also not 
considered in the model. More and more evidence demonstrate 
that CMEs are closely related to their associated IP shocks. 
Therefore, the kinematic characteristics of CMEs should be 
considered in the arrival time prediction for the associated 
shocks. This is the goal of this paper. 


2. SOLAR-IP SHOCK EVENTS AND THEIR 
ASSOCIATED CMEs 


Zhao & Feng (2014) studied 551 solar disturbance events 
during the period 1997 February—2006 December. These 
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events were taken from the Fearless Forecast publications (Fry 
et al. 2003; McKenna-Lawlor et al. 2006; Smith et al. 2009), 
with the exception of some interacting events and questionable 
events, covering nearly the whole Solar Cycle 23. The 
following parameters for each event were used in Zhao & 
Feng (2014): the start time of the Type II burst, the source 
location of the associated flare, the coronal shock speed 
(computed from the Type II radio burst drift rate), the proxy 
piston driving time duration, the solar wind speed in the 
background, and the arrival time of the associated IP shock at 
L1 if it reached Earth. The corresponding IP shocks were 
observed at Earth for 202 events and were referred to as “with- 
shock” (abbreviated as “W-shock”) events; the other 349 
events had no IP shocks observed at Earth and were called 
“without shock” (abbreviated as “W/O-shock”) events. 

We add information from the CMEs associated with these 
551 solar-IP shock events. For each event, we check the CME 
catalog observed by SOHO/LASCO (http: //cdaw.gsfc.nasa. 
gov/CME._list/) to find its accompanying CME. The CME that 
satisfies the following requirements is believed to be associated 
with the solar-IP shock event: (1) the first appearance time of 
the CME in LASCO/C2 is within +2 hr of the start time of the 
Type II burst; (2) the CME’s main propagation direction in 
LASCO’s observation is consistent with the source location of 
the solar-IP shock event; (3) if multiple CMEs satisfy (1) and 
(2), then we select the most eligible one. The Wind/WAVES 
Type II bursts and their associated CME (http://cdaw.gsfc. 
nasa.gov /CME_list/radio/waves_type2.html) catalog is also 
used for reference during our identification. For 202 “W- 
shock” events, there are 20 events during which LASCO/C2 
had data gaps, and thus there is no evidence of associated 
CMEs. Therefore, we exclude these 20 “W-shock” events from 
this study. Similarly, 33 “W/O-shock” events which occured 
when LASCO/C2 had data gaps are also excluded from this 
study. In this way, we obtain 498 solar-IP shock events as our 
event example, which includes 182 “W-shock” events and 316 
“W /O-shock” events. 

For the 182 “W-shock” events, 171 are associated with 
CMEs observed by SOHO/LASCO; the other 11 events are not 
associated with CMEs. Meanwhile, for 316 “W/O-shock” 
events, 269 are associated with CMEs observed by SOHO/ 
LASCO; the other 47 events are not associated with CMEs. 
Therefore, the total sample of 498 events can be re-divided into 
four groups: Group I (GP I): 171 Type II-CME events with 
corresponding IP shocks at Earth; Group II (GP ID): 11 Type H- 
non-CME events with their IP shocks at Earth; Group IH (GP 
Il): 269 Type II-CME events with no IP shock at Earth; and 
Group IV (GP IV): 47 Type II-non-CME events with no IP 
shock at Earth. 

Figure 1 shows the number frequency distribution of 440 
Type II-CME events (GP I + GP III) along the time difference 
(Atstart) between the start of the Type II burst and the launch of 
the CME in LASCO/C2. Here, Atstart > O indicates that the 
Type II start is earlier than the CME launch, while Afar < 0 
indicates that the CME launch is earlier than the Type II start. 
This demonstrates that the Type II burst started before the CME 
launch (Afgan: > 0) for 404 of the total events. The normal 
distribution of the event number is centered at 0.46 hr, which is 
also the average of Afgtar for the total 440 events, as shown by 
the vertical arrow. The delay of the CME launch is, at least 
partly, due to the different detecting principles for Type IIs and 
CMEs. The launch time of a CME adopted here is its first 
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Figure 1. Number frequency distribution of 440 Type II-CME events (GP I + 
GP III) along the time difference (Atstat) between the start of a Type II burst 
and the launch of the CME in LASCO/C2. Atsat > 0 indicates that the Type II 
start is earlier than CME launch, while Atstat < 0 indicates that the CME 
launch is earlier than the Type II start. The vertical dotted line indicates that 
Atstartt = 0, and the arrow represents the average of Atstart- 
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appearance in LASCO/C2. CMEs have propagated to a 
distance of about 2 to ~3 solar radii before they are detected 
by LASCO/C2. Therefore, this “launch time” is actually later 
than the triggering time of the eruption. On the other hand, the 
start time of a Type II burst, which corresponds to the 
formation time of a shock wave, is the time of energy release. 
This timing is near the triggering moment of a solar eruption, 
and should be earlier than the former. As a consequence, the 
start time of the Type II burst is adopted as the beginning of a 
solar event in the Fearless Forecast models. 


3. THE SHOCK PROPAGATION MODELS 
3.1. SPM 


Feng & Zhao (2006) established the first version of SPM. 
This model was based on the combination of an analytic 
solution to the propagation of blast waves (Wei 1982; Wei & 
Dryer 1991) and the empirical estimation method of shock 
energy (Smith & Dryer 1990, 1995). The analytic solution of 
this non-similarity theory for blast waves started from basic 
equations of ideal fluid dynamics under a spherically 
symmetric hypothesis. As opposed to the classical similarity 
theory, this solution includes a moving, steady-state medium 
with variable density. “Steady-state” means that the back- 
ground medium is in a state of dynamical equilibrium, and all 
of the parameters within it do not vary with time. The energy 
released from a point blast into the background medium is 
believed to be constant. After a series of complicated 
manipulations, the following equation is derived for the 
propagation speed (V,) of the wave front along the radial 
distance R (in units of AU): 


dR 2, Eo 1 
Y= — =| -2A,.4+ [(21)° + — + — ļuo. 1 
| i i 1) JoR mn "y 


Here, uo is the background solar wind speed, Jọ = $, 

Ar = —0.1808; Eo represents the dimensionless form of the 

total energy (E,) of the blast Ey = =, and A = 300kg m™'. 
ug 
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The integral of Equation (1) provides the shock’s transit time 


(TT) (T) to R: 
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TTo is computed by the requirement R = 0 when TT = 0. The 
shock’s total energy cannot be directly observed, requiring that 
a proxy be used. In the ISPM model, the shock’s total energy is 
believed to be proportional to its kinetic energy flux (v2), 
longitudinal width (w), and the duration time (7) of the initial 
pulse: 


E, = C. V2. w. (T + D). (3) 


Here, C = 0.283 x 10°°ergm *s 7 deg ', D = 0.52 hr, and 
an average of angular width wa = 60° is adopted. The 
combination of Equations (1)—(3) constitutes the SPM model. 
The input parameters of SPM include the initial shock speed 
(V,,;) computed from the Type II radio burst drifting speed, the 
duration time (7) of the associated X-ray flare, and the 
background solar wind speed (uo). The outputs of SPM provide 
the predicted TT of the shock to any radial distance R assuming 
that the shock will be able to reach R. 


3.2. SPM2 


Although capable of predicting the shock’s arrival time, the 
SPM model cannot judge whether or not an IP shock will hit 
the Earth. The contribution of the shock’s propagation direction 
to its arrival time is also neglected in SPM. The application of 
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the model to actual cases often leads to much faster predicted 
propagations of the shock than are observed. In addition, the 
initial shock speed computed from Type II radio bursts usually 
contains observational uncertainties due to the lack of spatial 
information. In order to overcome these drawbacks, Zhao & 
Feng (2014) put forward the second version of the model, i.e., 
SPM2. In SPM2, the following equation is adopted to compute 
the shock speed at a distance R: 


R E 1 
v=2 = }-2 + Jo +-+ z [e 


x Fip x Fpp (4) 


with Fap = 1.244—6.28 x 10 4u, Fpp = 0.85 + 0.2 
cos 6 cos ġ, V= (0.3818—3.0 x 1074V; sin 0) Vit+Vei, Eo = 
C. VÆ .w. (T+ D) 
Aug 

acceleration/deceleration and the propagation direction, 
respectively. 0 denotes the source longitude and @ denotes 
the source latitude of the shock. VŠ is the initial speed of the 
shock adjusted from V,;. The shock speed at the Earth’s 
location (V,(EL)) can be computed from Equation (4), and then 
the Equivalent Shock Strength Index (ESSI) of the shock is 


V (EL) — uo 
Vj 7 


. Here, Fap and Fpp are the correction terms for 


ESSI = (5) 
Here, V; is the fast-mode wave speed of the background solar 
wind at Earth, which is taken to be 100km s™!. If ESSI > 
ESSI,, = 2.29, then SPM2 predicts that the shock will reach the 
Earth and the corresponding TT at Earth is computed from 


TT = Jo {4A [R + 2Eo 
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Otherwise, SPM2 predicts that the shock will miss Earth when 
ESSI < ESSI,, = 2.29. The input parameters of SPM2 are the 
same as those of SPM with the addition of the shock’s source 
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Table 1 

The Mean Value of the Initial Parameters for the Four Group Events 
Mean Value GPI(171) GPH(11) GPM (269) GPTV (47) 
Va (kms!) 1231 943 980 953 
7 (hr) 1.81 0.81 1.16 0.92 
Veme (km s7’) 1108 z 672 i 
AWeme (°) 262 ve 142 
Zcme (ms 7) —8.72 e —8.84 ve 
Vew (km s~') 456 437 462 451 
wv ©) 43.3 25.8 54.9 37.8 


Note. The event number of the four groups is given in parentheses. V, is the 
initial shock speed; 7 is the proxy piston driving time duration; Veme is the 
linear speed of CMEs within LASCO’s FOV; AWewme is the sky-plane width of 
the CMEs, which is typically measured in the C2 FOV after the width becomes 
stable; acme is the acceleration of CMEs; Vsw is the background solar wind 
speed; 4) is the propagation angle of the shock relative to the Earth. All of the 
parameters are the averaged values of the corresponding group. 


location on the Sun. The outputs of SPM2 determine whether a 
shock will reach the Earth and corresponding arrival time if 
it does. 


4. INFLUENCE OF CME’S CHARACTERISTICS ON 
ARRIVAL OF THE ASSOCIATED SHOCK 


4.1. With/Without CMEs 


The combination of GP I and GP III provides 440 Type II 
burst events associated with CMEs, 171 of which have 
corresponding IP shocks reaching Earth; for the other 269 
events, their IP shocks failed to reach Earth. The shock-arrival 
(at Earth) events account for 39% of the total events. Similarly, 
the combination of GP II and GP IV provides 58 Type II burst 
events with no associated CMEs, of which only 11 events have 
IP shocks arriving at Earth; the percentage of shock-arrival 
events is only 19%. We can see that the Earth-reaching 
probability for the CME-associated shocks is twice that of the 
non-CME-associated shocks. 

In order to better demonstrate statistically the influence of a 
CME on a shock’s arrival, Table 1 provides the mean values of 
the initial parameters for these four groups of events. In the 
following, V,,; stands for the adjusted initial shock speed (i.e., 
VŽ in Zhao & Feng 2014). 7 is the proxy piston driving time 
duration. Here, the proxy piston driving time duration T is 
defined as the interval between the two half-peak points of the 
GOES X-ray flux of the associated flare. It denotes the duration 
of the X-ray over half-peak flux. Vemp is the CME speed 
obtained by fitting a straight line to the height-time measure- 
ments. Thus, it is an average speed within LASCO’s field of 
view (FOV). AWcme is the angular width of the CME projected 
on the sky-plane of SOHO. The angular width is typically 
measured in the FOV of LASCO/C2 after the width becomes 
stable. acme is the CME’s acceleration. Vsw is the background 
solar wind speed. W% is the propagation direction angle of the 
shock, which represents the angular distance between the 
shock’s main propagation direction and the Sun—Earth line. All 
of the parameters listed in Table 1 are the averaged values of 
the corresponding group. 

We quantitatively compare the parameters of the four groups 
as listed in Table 1. For GP I and GP II, all of their shocks 
reached Earth. GP I events are associated CMEs, but GP II 
events are not. The shock speeds of GP I are faster than those 
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of GP II. The piston driving times of GP I are also longer than 
those of GP II. The background solar wind speeds of the two 
groups have no apparent differences. GP I shocks have larger 
direction angles than GP II shocks. It can be seen that solar 
disturbances associated with CMEs are stronger (faster, longer 
duration) than those with no CMEs. Meanwhile, the smaller 
direction angles of the latter group would help their shocks to 
reach Earth. 

GP I and GP III are all associated with CMEs. The shocks of 
GP I arrived at Earth, but those of GP III did not. The speeds 
(both shock and CME) of GP I are faster than those of GP III. 
The durations of GP I are also longer than those of GP III. The 
mean value of the CMEs’ angular width for GP I (262°) is 
nearly twice that of GP III (142°). As for acme and Vsw, no 
significant differences are found between them. The direction 
angles of GP I are smaller than those of GP III. It follows that 
faster and wider disturbances have a higher probability of 
reaching Earth, but the larger propagation directions away from 
the Sun—Earth line would increase the disturbance’s probability 
of missing the Earth. 

For GP II and GP IV, none of the shocks are associated with 
CMEs. The shocks of GP II arrived at Earth, but those of GP IV 
did not. The parameters of initial shock speed, time duration, 
and background solar wind speed for GP IV events are similar 
to those of GP II events. However, the direction angles of GP 
IV are larger than those of GP II. It seems that the greater 
propagation directions of GP IV shocks weaken their 
probability of reaching Earth. 

For GP III and GP IV, no shocks arrived at Earth. GP III 
events are associated with CMEs, but GP IV events are not. 
The initial shock speeds for GP III are slightly faster than those 
of GP IV. The durations of GP II shocks are also longer than 
those of GP IV. The background solar wind speeds of the two 
groups are similar. The direction angles of GP III are obviously 
larger than those of GP IV. That is to say, these solar distances 
associated with CMEs are more violent. However, the direction 
of their propagation leads to their IP shocks missing the Earth. 


4.2. CME Speed 


Propagation speed is one of the most important factors 
determining the arrival times of solar transients. For each of the 
440 Type II-CME events (GP I + GP III), we have two speeds, 
i.e., the initial shock speed V,; and the CME speed Veme. These 
two speeds are obtained through different mechanisms, and 
have their own limitations and applications. Vcmeg is measured 
by the coronagraph (LASCO) on the sky-plane perpendicular 
to the Sun—Earth line for SOHO, and therefore it does not 
represent the propagation speed of the CME in Earth’s 
direction. V,; is computed from the drifting speed of a Type 
II burst as the shock propagates from the high-density corona to 
the low-density IP medium. The shock speed derived in this 
way thus lacks directional information. Voy is often used in 
empirical models to predict the arrival times of CMEs and their 
related shocks. Meanwhile, Vs; is widely used in physics-based 
models to predict the arrival of IP shocks. Figure 2 displays the 
variation of Vcme plotted versus V,; for the 440 events. GP I 
events are denoted as asterisks and GP III events as pluses. The 
dashed line is the line to fit the data points for both GP I and GP 
II. The correlation coefficient (CC) between the two speeds is 
0.55, which demonstrates a generally good correlation between 
the two speeds. The corresponding shock speed is usually large 
(small) for fast (slow) CMEs. The dotted line represents 
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Figure 2. Distribution of CME speeds Vcme plotted against initial shock 
speeds V,; for GP I and GP III events. The dashed line denotes the line fitting to 
the data points. The dotted line indicates that Vcme = Vi. 


Vi = Vome. It can be seen that Vi > Vcme for 319 of the total 
440 Type II-CMEs, and the average of Vi/Vome is 1.7. This 
means that the shock speed estimated from Type II radio bursts 
is usually larger than the CME speed obtained from 
coronagraph observations. This is consistent with the findings 
of Sun et al. (2002a). The reasons for this discrepancy could be 
as follows. On one hand, the Vcmeg used here is the projected 
speed of CMEs on the sky-plane of SOHO, which is usually 
less than the radial speed of CMEs. On the other hand, Veme is 
actually the averaged speed within LASCO’s FOV. Fast CMEs 
will undergo decelerations during this stage. The averaged 
speed obtained in this way would be less than the initial speed 
(like V,;) in these cases. 

We then investigate how well V,; and Vcmeg correlate with 
the shock’s TT at Earth. As no IP shock was observed at Earth 
for GP III events, only GP I events are analyzed. Figure 3 
shows the variation of TT plotted versus V,; (Figure 3(a)), 
versus Vcme (Figure 3(b)), and the CCs between them for 171 
GP I events. The dashed line is the line to fit to data points. It 
follows that CC between V,; and TT is —0.36, and CC between 
Vome and TT is —0.35. Their correlations are very similar. In 
other words, V, and Vcmg are equivalent as far as the 
prediction of TT is concerned. We try to use their combination 
to construct a better input speed in the revised model. The 
following procedures are similar to the “parameter train” of the 
SPM2 model (Zhao & Feng 2014). First, we assume all of the 
other input parameters as well as the arrival times for these 171 
GP I events, except for the input speed V;. The SPM2 model is 
conversely used to compute the “expected” initial speed (Vexp) 
for each event. Then, we use the linear combination of V,; and 
VomME to fit Vexp: 


Vexp = Co + Vi + C2 Vee. (7) 


For 171 GP I events, the least-squares fit to this two-variable 
linear regression equation can determine the coefficients co, C1, 
and c32: 
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Figure 3. Variation of shock’s transit times to Earth TT plotted against initial 
shock speeds V,; (a) and against CME speeds Vcme (b) for GP I events. The 
dashed line denotes the line fitting to data points. 


In this manner we obtain co = 230.5, cy = 0.559, and 
Cy = 0.274. Therefore, the following equation will be adopted 
to compute the input speed V; in the new model: 


V; = 230.5 + 0.559V + 0.274Vome. (9) 


In addition, we also give the one-variable linear fit between 
Vexp and Veme: 


V; = 391.1 + 0.70VomE (10) 


which is derived in the same way as Equation (9) and can be 
used for CME-associated events without Type I bursts (not for 
this study). 


4.3. CME Angular Width 


Angular width is another important parameter for CMEs in 
coronagraph observations. Although it is only a two-dimen- 
sional (2D) angular width projected on the sky-plane of the 
SOHO spacecraft, AWcmęg can at least partially reflect how 
wide a CME is. CMEs are called partial halo CMEs if 
120° < AWeme < 360° and full halo CMEs if AWcme = 360°. 
We investigate the correlation between AWcme and TT for 171 
GP I events. We find that CC is only —0.17, indicating only a 
very weak correlation between AWcmeg and TT. Therefore, 
AWcme may make few contributions to the TTs of the 
associated shock. 

On the other hand, the angular width of a CME is an 
important factor determining whether or not a CME encounters 
Earth (e.g., Shen et al. 2014). AWcme should contribute to the 
shock’s arrival as wider disturbances have a greater chance of 
reaching Earth. Figure 4 shows the number frequency distribu- 
tion along AWcme for GP I (top) and GP III (bottom) events. 
The distributions are different for these two groups. For 171 GP I 
events, 97 events are associated with full-halo CMEs 
(AWcme = 360°), accounting for 56.7%; the average AWom is 
262° for these 171 CMEs, as shown by the vertical dashed line. 
Meanwhile, for 269 GP III events, the number of full-halo CMEs 
is 42, accounting for only 15.6%; the peak of the event number 
distribution is located near AWcmg = 60° and the average 
AWcme is 142° for these GP III events, as shown by the vertical 
dashed line. In other words, in a statistical sense, GP I CMEs are 
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Figure 4. Number frequency distribution along the CME angular width 
AWcme for GP I events (up) and GP III events (bottom). The dashed lines 
denote the mean values of AWcme for the two groups. 


significantly wider than GP II CMEs. This proves the 
hypothesis that the width of a CME may potentially contribute 
to the associated shock’s arrival at Earth. 


4.4. Judgements for Shock Arrival at Earth 


A judgement index is needed in a prediction model to predict 
whether or not an IP shock will encounter Earth. In SPM2, 
ESSI is adopted as the judgement index. When ESSI > ESSI,, 
the shock is predicted to reach Earth; otherwise, the shock is 
predicted to miss Earth. ESSI,, is the empirically prescribed 
threshold value of ESSI. We know from the above analysis that 
a CME’s angular width is also an important parameter 
influencing the associated shock’s arrival at Earth. Therefore, 
we can use the combination of ESSI and AWcme to predict the 
shock’s arrival for CME-associated events (GP I + GP HI) in 
order to improve the prediction success rate. After a series of 
tests, the double standard, as follows, gives the best prediction. 
When ESSI > ESSI,,; and AWcme > AW,, the shock is 
predicted to arrive at Earth; otherwise, the shock is predicted to 
miss the Earth. Here, ESSI,,; = 1.53, AW, = 121°. Figure 5 
displays the scattered plot between ESSI and AWcme for 171 
GP I events (solid circles) and 269 GP III events (hollow 
circles). It can be seen from Figure 5 that solar events with 
shock arrival at Earth generally have larger ESSI or AWcme. 
However, there is still a large proportion of events with large 
ESSI or AWcmeg which are not followed by shock arrivals at 
Earth. Therefore, a single standard of ESSI or AWcme would 
usually inevitably lead to a certain amount of false alarms or 
misses. The double standard, on the other hand, would reduce 
the number of wrong predictions, as shown by the dashed- 
dotted lines. In Figure 5, the shocks in region I are predicted to 
reach Earth, and those in region II are predicted to miss Earth. 

For 58 Type Il-non CME events (GP II + GP IV), no CME 
angular width can be adopted in the judgement index. 
Therefore, ESSI becomes the only parameter used to predict 
a shock’s arrival at Earth. Figure 6 demonstrates the 
distribution of the prediction success rates for these 58 events 
along ESSI,,2. The solid curve in this figure denotes the success 
rate of W-shock events, the dotted curve denotes that of W/O- 
shock events, and the dashed—dotted curve denotes that of all 
events. As opposed to SPM2, we do not select the intersection 
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Figure 5. Scattered plot between ESSI and AWeye for 171 GP I events (solid 
circles) and 269 GP III events (hollow circles). The dashed—dotted lines 
represent the threshold values of ESSI,, = 1.53 and AWy = 121°. 
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Figure 6. Prediction success rates for 58 Type II-non CME events (GP II + GP 
IV) plotted vs. the threshold values of ESSI used to predict whether or not a 
shock will encounter Earth. The solid curve denotes the success rate of the 
W-shock events, the dotted curve denotes that of the W/O-shock events, and 
the dashed—dotted curve denotes that of all events. The vertical dashed line 
represents ESSIiy2 = 2.65. 


point of these curves as the final ESSI,,2. Instead, we take 
ESSI,y2 = 2.65, shown as the vertical dashed line, in order to 
increase the prediction success rate of the total events. This 
ESSIy2 gives 3 “hit” predictions for 11 GP II events, 39 
“correct null” (cn) predictions for 47 GP IV events, and the 
total prediction success rate for these 58 events is 72% (42/58). 


5. SPM3 AND ITS PREDICTION RESULTS 
5.1. SPM3 


According to the above analysis, the new SPM3 model is 
defined as follows. For solar events associated with CMEs, 
both the initial shock speed V,; and CME speed Veme are used 
to compute the initial speed of the disturbance V; based on 
Equation (9). This initial speed V;, together with other input 
parameters, is used to compute the shock energy according to 
E, = ©-Wew +d) 
0 Aug 
at Earth’s orbit and the corresponding ESSI are derived based 


on Equations (4) and (5). If ESSI > ESSI,,; = 1.53 and 


. Then, the disturbance’s propagation speed 
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Table 2 
Forecast Contingency Table of Hits (a), False Alarms (b), Misses (c), and Correct Nulls (d) for the Prediction Results of SPM3, SPM2 Compared With Those of 
HAFv.2, STOA, ISPM Based on the Events of Solar Cycle 23 (1997.2—2006.12) 


SPM3 SPM2 HAFv.2 SPM3 SPM2 STOA SPM3 SPM2 ISPM 
Forecast Prediction Prediction Prediction Prediction Prediction Prediction Prediction Prediction Prediction 
Observation Yes No Yes No Yes No Yes No Yes No Yes No Yes No Yes No Yes No Yes No 
Yes a c 122 60 112 70 139 43 99 52 97 54 119 32 86 51 88 49 70 67 
No b d 90 226 124 192 186 130 68 199 102 165 170 97 62 176 92 146 88 150 
Total a+b ctd 212 286 236 262 325 173 167 251 199 219 289 129 148 227 180 195 158 217 


Note. This table is transposed from the format adopted by Smith et al. (2000), Fry et al. (2001, 2003), and McKenna-Lawlor et al. (2006) in order to facilitate the 
comparison of different models. Comparisons between SPM3, SPM2, and HAFv.2 are based on 498 events, between SPM3, SPM2, and STOA based on 418 events, 


and between SPM3, SPM2, and ISPM based on 375 events. 


AWcme > AW, = 121°, then the corresponding shock is 
predicted to be able to reach Earth, and its TT is predicted 
based on Equation (6); otherwise, the shock is predicted not to 
reach Earth. For solar events associated with no CMEs, the 
shock speed V„ is used as the disturbance’s initial speed to 
compute ESSI at Earth’s location. If ESSI > ESSI,y2 = 2.65, 
then the shock is predicted to reach Earth and its transit time is 
obtained according to Equation (6); otherwise, the shock is 
predicted to miss Earth. The inputs to SPM3 are the same as 
those to SPM2, in addition to the speed and angular width of 
the associated CME in LASCO observations. The outputs of 
SPM3 include whether shocks will encounter Earth and their 
corresponding arrival times. 


5.2. Prediction Results, Verification, and Statistics 


Fundamental definitions of the prediction results are adopted 
in SPM2 and the Fearless Forecasts models to evaluate the 
efficiency of a prediction model. These definitions are as 
follows: “Hit” (h) when a shock is both predicted to arrive at 
Earth and is actually observed within +24 hr of the predicted 
time; “Miss” (m) when a shock is observed but not predicted to 
arrive, or the predicted arrive time deviates 24 hr away from the 
real one; “False alarm” (fa) when no shock is observed 
1-5 days after the solar event but the model predicts it to arrive; 
and (cn) when a shock is neither observed 1-5 days after the 
solar event nor predicted to arrive. In this manner, the 
prediction of SPM3 for 498 sample events yields 122 hits, 90 
false alarms, 60 misses, and 226 cns. In comparison, the 
prediction of SPM2 yields 112 hits, 124 false alarms, 70 
misses, and 192 cns, and the prediction of HAFv.2 yields 139 
hits, 186 false alarms, 43 misses, and 130 cns for these 498 
events. Table 2 demonstrates the forecast 2 x 2 contingency 
table for the prediction results of SPM3, SPM2 compared with 
those of HAFv.2, STOA, and ISPM, respectively. This 
contingency table, usually adopted to evaluate meteorological 
models, has been transposed to list side by side the prediction 
results of different models (Schaefer 1990; Smith et al. 2000; 
Fry et al. 2001, 2003). Here a, b, c, and d are the event numbers 
of h, fa, m, and cn, respectively. The total event number is 
given by N = a+b-+c-+d, which is 498 for the SPM3, SPM2, 
and HAFv.2 models. However, STOA was only available for 
418 events, and ISPM was only available for 375 of the 498 
total events in the Fearless Forecast predictions. In order to 
demonstrate the statistics more reasonably, this table lists the 
prediction results of SPM3, SPM2 compared with those of 
HAFv.2, STOA, and ISPM based on the same data samples, 
respectively. We know from the table that both the numbers of 
hits and cns for SPM3 are larger than those for SPM2. The total 


number of successful predictions (a+d) is 348, 304, and 269 
for SPM3, SPM2, and HAFV.2. The prediction of SPM3 gives 
the maximum number of successful predictions, which is 44 
more than that of SPM2 and 79 more than that of HAFv.2. 
Similarly, the prediction for SPM3 provides 298 successes for 
418 STOA events, 82 more than that for STOA (216) and 262 
successes for 375 ISPM events, 42 more than that for ISPM 
(220). The successful prediction numbers of SPM2 lie between 
those of SPM3 and HAFv.2, STOA, and ISPM. 

In order to further evaluate a model’s prediction efficiency, a 
series of statistical forecast skill scores are derived from the 
contingency table. These scores are as follow: (1) PODy = a/(a 
+c), probability of detection yes; (2) PODn d/(b+d), 
probability of detection no; (3) FAR = b/(a+b), false alarm 
ratio; (4) BIAS = (a+b)/(a+c); (5) CSI = a/(a+b+c), critical 
success index; (6) TSS = PODy + PODn —1, true skill 
statistic; (7) HSS = (a+d-C1)/(N-C1), Heidke skill score, and 
Cl C2 +(b+d)(c+d)/N, C2 (a+c)(at+b)/N; (8) 
GSS = (a-C2)/(a+b+c-C2), Gilbert skill score; (9) SR = (a 
+d)/N, success rate. More details about these skill scores can 
be found in Schaefer (1990), Mozer & Briggs (2003), Smith 
et al. (2009), McKenna-Lawlor et al. (2012), and Zhao & Feng 
(2014). Each skill parameter reflects the prediction capability of 
a model on a certain aspect. For example, PODy denotes the 
ratio between the hit shocks and all of the Earth-encountered 
shocks. The value of 1 for PODy indicates that all of the Earth- 
encountered shocks are correctly predicted, and 1 is referred to 
as the “ideal value” of PODy for a “perfect” model. PODn, on 
the other hand, denotes the ratio between cns and all of the 
Earth-missed events. Meanwhile, SR represents the prediction 
success rate for all of the events. 

Table 3 shows a statistical comparison of these skill scores 
derived by SPM3, SPM2 with those derived by HAFv.2, 
STOA, and ISPM based on the same events. The ideal values 
of these skill scores for a perfect prediction are also shown in 
this table for reference. For all 498 events, SPM3 performs 
better than SPM2 for all of these nine skill scores. The 
prediction success rates of SPM3 for the Earth-encountered 
shocks (PODy), the Earth-missed shocks (PODn), and the total 
shocks (SR) are 67%, 72%, and 70%, which are 5, 11, and 9 
percentage points higher than those of SPM2, respectively. 
This indicates a significant improvement in the forecast 
accuracy of SPM3 compared with the previous SPM2 model. 
Although HAFv.2 gives a greater success rate for the Earth- 
encountered events (PDDy) than SPM3 (76% versus 67%), it 
gives a much lower success rate for the Earth-missed events 
(PODn; 41% versus 72%). The seven other skills of HAFv.2 
are all worse than those of SPM3. In particular, the success rate 
for the total events (SR) of HAFv.2 is 16 percentage points 
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Table 3 
Statistical Comparison of the Values of Several Standard Meteorological Forecast Skill Scores Derived by SPM3, SPM2 With Those Derived by HAFv.2 
Based on 498 Events, With Those of STOA Based on 418 Events, and With Those of ISPM Based on 375 Events 
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Skill scores Ideal value SPM3 SPM2 HAFv.2 SPM3 
PODy 1 0.67 0.62 0.76 0.66 
PODn 1 0.72 0.61 0.41 0.75 
FAR 0 0.43 0.53 0.57 0.41 
BIAS 1 1.17 1.30 1.79 1.11 
CSI 1 0.45 0.37 0.38 0.45 
TSS 1 0.39 0.22 0.18 0.40 
HSS 1 0.37 0.21 0.15 0.39 
GSS 1 0.23 0.12 0.08 0.24 
SR 1 0.70 0.61 0.54 0.71 
a N 70.2 23.0 15.6 64.6 
p-Value 0 0 16x 10° 77x10 89x 10716 


SPM2 STOA SPM3 SPM2 ISPM 
0.64 0.79 0.63 0.64 0.51 
0.62 0.36 0.74 0.61 0.63 
0.51 0.59 0.42 0.51 0.56 
1.32 1.91 1.08 1.31 Mis] 
0.38 0.37 0.43 0.38 0.31 
0.26 0.15 0.37 0.26 0.14 
0.24 0.13 0.36 0.24 0.14 
0.14 0.07 0.22 0.14 0.07 
0.63 0.52 0.70 0.62 0.59 

26.2 10.4 49.1 22.8 TA 

3.1 x 1077 1.3 x 10 2.5 x 10-? 1.8 x 10° 7.7 x 10-3 


Note. Here N is the total number of events that the model has predictions for, which is 498 for SPM3, SPM2, and HAFv.2, 418 for STOA, and 375 for ISPM. A x? test 
with a p value is adopted to check the statistical significance of the forecasts. A larger x? value indicates a better dependence between observations and predictions, 
and a value of p < 0.05 indicates a high level of significance (better than the acceptable level when 0.05 < p < 0.2) for the forecast. 


lower than that of SPM3. A x” test with a p-Value is also used 
in this table to check the statistical significance of the forecasts. 
The larger (smaller) the x? (p) is, the better the dependence 
between observations and predictions becomes. The signifi- 
cance level of a forecast is high when p < 0.05, and is lower, 
but still acceptable, when 0.05 < p < 0.2 (McKenna-Lawlor 
et al. 2006, 2012). It is found that p < 0.05 for SPM3, SPM2, 
and HAFy.2, demonstrating the significant statistics levels for 
their prediction results. SPM3, in particular, has the best 
prediction performance as shown by the 7 test. 

Similar results are found for 418 STOA events. SPM3 
performs better than SPM2 for nine skill scores, and better than 
STOA for eight scores (except for PODy). The prediction 
success rate of SPM3 for all 418 events is 71%, which is 8 
percentage points higher than that of SPM2, and 19 percentage 
points higher than that of STOA. The x test also verifies the 
good prediction performance of STOA, the better performance 
of SPM2, and the best performance of SPM3. As for the 375 
ISPM events, SPM3 performs better than SPM2 for eight 
scores (except for PODy), and better than ISPM for all nine 
scores. The prediction success rate of SPM3 for all 375 events 
is 70%, which is eight percentage points higher than that of 
SPM2, and 11 percentage points higher than that of ISPM. The 
prediction performance of ISPM is good, the performance of 
SPM2 is better, and the performance of SPM3 is best as shown 
by the xX? test. 

To summarize, the prediction of SPM3 has the highest 
success rate and best performance compared with those of 
SPM2, HAFv.2, STOA, and ISPM. The success rate of SPM3 
for all of the events is 70%-—71%, which is evidently higher 
than that of the previous SPM2 model (61%-63%), and even 
higher than those of HAFv.2, STOA, and ISPM (52%-59%). 


5.3. The Prediction Error of TT 


The prediction error of the shock’s TT at Earth, i.e. AT, 
defined as the difference between the observed and predicted 
arrival times, is another parameter used to demonstrate the 
precision of the prediction models. The MA error and rms error 
are two commonly adopted measurements of AT. Table 4 lists 
the prediction errors of these models based on the same data 
samples. The number of hit events in computing these errors is 
also shown. Among all 498 events, the MA AT of SPM3 is 
9.01 hr for 122 hit shocks, which is less than both the 9.58 hr of 


Table 4 
Comparison of Prediction Errors of the Shock’s Transit Time 
Derived by SPM3, SPM2 with those by HAFv.2, STOA, and ISPM 
Based on the Same Data Events 


Model Hit Number MA* Error (hr) rms? Error (hr) 
SPM3 122 9.01 10.91 
SPM2 112 9.58 11.60 

HAFv.2 139 9.26 11.34 
SPM3 99 8.94 10.76 
SPM2 97 9.64 11.59 
STOA 119 10.04 11.77 
SPM3 86 8.87 10.64 
SPM2 88 9.49 11.56 
ISPM 70 8.88 10.97 

Notes. 


è MA denotes the mean-absolute value of the prediction error. 
rms denotes the root mean square value of the prediction error. 


the MA error for 112 hits (SPM2) and the 9.26 hr of MA error 
for 139 hits (HAFv.2); the rms error of SPM3 is 10.91 hr, 
which is also less than the 11.60 hr of rms error of SPM2 and 
the 11.34hr of rms error of HAFv.2. Similar results are 
obtained for comparisons with STOA and ISPM. The MA error 
of SPM3 is 8.94 hr for 99 hits among 418 STOA events, which 
is less than both the 9.64 hr of the MA error of SPM2 and the 
10.04 hr of the MA error of STOA; the rms error of SPM3 
(10.76 hr) is the least among the three models. As for the 375 
ISPM events, the MA and rms errors of SPM3 are 8.87 hr and 
10.64 hr, respectively, which are less than the corresponding 
errors of SPM2 and ISPM. Briefly, the prediction errors of the 
shock TTs of SPM3 are within 9hr (MA) and 11 hr (rms), 
which are the minimum among those of a group of models 
including SPM2, HAFv.2, STOA, and ISPM. 


6. CONCLUSION AND DISCUSSION 


Based on a set of 498 solar-IP shock events during 
1997.02—2006.12, the influence of CMEs on associated shock 
arrivals at Earth are analyzed. It is found that the shocks 
associated with CMEs have a larger probability of reaching 
Earth than those not associated with CMEs. The angular width 
of the associated CME, although only a 2D projection on the 
sky-plane of the spacecraft, can be a useful index for judging 
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whether the shock will hit Earth. The CME speed, obtained 
through remote-sensing observations of coronagraphs, are still 
valuable in predicting the shock arrival time. These results lead 
to the third version of SPM3. For the CME-associated events, 
the CME speed and the initial shock speed computed from 
Type II burst data are combined to compute the initial 
propagation speed of the disturbance in SPM3. A double 
standard including both the shock strength index ESSI and 
CME angular width AWcme is adopted to predict whether or 
not the shock will hit Earth. For the non-CME-associated 
events, only the Type II speed is input into the model as the 
initial propagation speed to predict the shock’s arrival 
(including arrival time) at Earth, modifying the threshold value 
of ESSI. The prediction results of SPM3 for 498 sample events 
reveal that the success rate for all events is 70%—71%, which is 
significantly higher than that of SPM2 (61%-63%). The 
deviations of the predicted TTs of SPM3 are within 9 hr on 
the MA error and 11 hr on the rms error, which are less than 
those of SPM2. Comparisons between SPM3 and the Fearless 
Forecast models demonstrate the superiority of SPM3 as well. 

Admittedly, although the prediction success rate of SPM3 is 
greatly improved compared with the previous SPM2 model, its 
improvements in the shock TT prediction are limited. Many 
factors could potentially cause this. On one hand, the sample 
events used in this study are CMEs from Solar Cycle 23 when 
SOHO was the only spacecraft tracking their movements in the 
sky-plane. The CME speed derived in this way is the projected 
speed, which does not represent the propagation speed of the 
CME along the Sun—Earth direction. Large uncertainties in Veme 
restrict further improvements to the TT prediction of the shock. 
Possible solutions to this restriction could include adopting the 
CME’s radial speed, which is derived from models (such as 
cone models) based on single spacecraft observations (Jang 
et al. 2014). Another solution involves estimating the initial 
geometry and three-dimensional (3D) speeds of CMEs based on 
observations from multiple spacecraft (STEREO, SOHO; Kilpua 
et al. 2012; Gopalswamy et al. 2013; Lee et al. 2013). On the 
other hand, the input parameters used in this study were obtained 
when the disturbances propagated near the Sun. Therefore, the 
lead time of SPM3’s prediction is very long, nearly the whole TT 
of the disturbance from the Sun to Earth, as the model is analytic 
and thus requires no running time. Models based on heliospheric 
image data (STEREO HIs and SMED) can provide more accurate 
predictions for arrival times but with shorter lead times 
(Colaninno et al. 2013; Mishra & Srivastava 2013; Mostl 
et al. 2014). For example, Webb (2013) applied the Tapping- 
Howard model (Tapping & Howard 2009) to predict the arrival 
time at Earth of the 2011 February 15 CME event based on HI 
and/or SMEI observations, and the corresponding prediction 
accuracy could be within an hour. However, the prediction’s 
lead time was only several hours. Kilometric Type II radio burst 
emission can also be used to track shock dynamics in the inner 
heliosphere and provide shock arrive time predictions (Corona- 
Romero et al. 2013; Xie et al. 2013). Further prediction models 
considering these factors should be developed based on the 
events of Solar Cycle 24. This is the next goal of our research. 
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